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Abstract 

We introduce and analyze a natural geometric version of Renegar's condition number 1Z 
for the homogeneous convex feasibility problem associated with a regular cone C C R™. 
Let Gr„^ m denote the Grassmann manifold of m-dimensional linear subspaces of R" and 
consider the projection distance d p (Wi,W-2) ■= ||ITvki — n^ll (spectral norm) between 
Wi, W2 <E Gr„. m , where Tlwi denotes the orthogonal projection onto W%. We call ^(W) := 
max{dp(W 1 TF') -1 \ W <G S m } the Grassmann condition number of W € Gr„ m , where the 
set of ill-posed instances S m C Gr„ jm is defined as the set of linear subspaces touching C. 
We show that ifW = im(A T ) for a matrix A e R mxr \ then tf(W) < 11(A) < c g{W) k(A), 
where k(A) = \\ A\\ WA^ \\ denotes the matrix condition number. This extends work by 
Bclloni and Freund in Math. Program. 119:95-107 (2009). Furthermore, we show that 
^(W) can as well be characterized in terms of the Riemannian distance metric on Gr ra rn . 
This differential geometric characterization of ^(W) is the starting point of the sequel 
[arXiv: 11 12.2603] to this paper, where the first probabilistic analysis of Renegar's condition 
number for an arbitrary regular cone C is achieved. 

Key words: convex programming, perturbation, condition number 

1 Introduction 

It is by now a well established fact [26, 27, 30, 17, 15, 11, 12] that the running time of a 
variety of algorithms in linear programming can be efficiently bounded in terms of a notion of 
condition. The condition is defined as a measure of sensitivity of the output with respect to 
small perturbations of the input. Different variants of this notion exists: the most common is 
the one originally introduced by Jim Renegar [25, 26, 27]. 

The analysis of the probability distribution of the condition of random input data is a 
thoroughly studied subject for a variety of numerical problems, compare the recent survey [7] 
for references. It has recently received increased attention through the concept of smoothed 
analysis, introduced by Spielman and Teng [28], who also managed to perform a smoothed 
analysis of Renegar's condition number for linear programming [13]. 



'Institute of Mathematics, University of Paderborn, Germany. Partially supported by DFG grant BU 
1371/2-1 and DFG Research Training Group on Scientific Computation GRK 693 (PaSCo GK). 



The motivation of the present work is to extend such probabilistic analyses to general con- 
vex programming, notably to second-order and semidefinite programming. Renegar's condition 
number is hard to analyze directly. In fact, behind the analysis in [13] there is an intermediate 
concept, the so-called GCC-condition number [18, 10] tailored to the LP cone R", that has 
nice geometric characterizations that facilitate its probabilistic analysis, see [9, 2]. All known 
probabilistic analyses of condition numbers for linear programming heavily rely on the product 
structure of the cone K+ x • • • x M + and thus cannot be extended to general convex cones. In 
this paper we introduce a coordinate-free, geometric notion of condition of independent inter- 
est, that allows to overcome this difficulty at the price of working in the intrinsic geometric 
setting of Grassmann manifolds. 

1.1 Renegar's condition number 

A regular cone C C W 1 is a closed convex cone with nonempty interior that does not contain 
a nontrivial linear subspace. The dual cone 1 of C is defined as C := {z G R n | Vx G C : z T x < 
0}. If C is regular, then C is regular as well. We call C self-dual if C = —C. Important cones 
for applications are, besides the LP case C = , the second order cones C = C ni x . . . x C nk , 
where C n := {x G M. n \ x n > (x\ + • • • + x 2 ^) 1 / 2 } , and the cone of positive semidefinite 
matrices. All these cones are self-dual. 

In the following we fix a regular cone C C W l . Throughout the paper we assume that 
1 < m < n. The homogeneous convex feasibility problem is to decide for a given matrix 
A G M mxn the alternative 

3x£l"\{0} s.t. Ax = 0,xeC, (P) 
3y G R m \ {0} s.t. A T y G C . (D) 
We define the sets of primal and dual feasible instances with respect to C, respectively, by 

Tl := {A G R mxn | (P) is feasible} = {A | ker(A) n C ^ {0}} , (1) 

T» := {A G R mxn | (D) is feasible} = M? xn U {A | im(^ T ) n C ± {0}} , (2) 

where R™ xn denotes the set of rank-deficient matrices in M mxn . Here are the most relevant 
properties of the sets J-^ and J-^: they are closed, both their boundaries coincide with the set 
of ill-posed inputs S R := J 7 ^ n , and R mxn = u (We shall state and prove related 
statements in Section 1.2). We also note that J 7 ^ and J-^ are invariant under the action of 
the general linear group GL(m) on R mxn by left multiplication. 
Renegar's condition [25, 26, 27] is defined as the function 

K:=K c :R mxn \{0}^[l,^], TZ C (A) := -M-^ , (3) 

where \\A\\ denotes the spectral norm, and d(A,Ti R ) = min{||yl — A'\\ \ A' G S H }. One can 
also characterize 7£(j4) _1 as follows: 

K( .,^ mM { r |i, A .„ £ ,piK(^-^ *At%) 

That is, 1Z(A)~ l is the maximum r > such that all pertubations /S.A of norm at most r\\A\\ 
do not change the feasibility status of A. 

1 Somc authors call C the polar cone and — C the dual cone. 
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1.2 The Grassmann condition number 

The Grassmann manifold Gr n>m is defined as the set of m-dimensional linear subspaces W 
of R n . It is well known that Gr„ )m is a compact smooth manifold on which the orthogonal 
group 0(n) acts transitively, see for instance [6]. 

Following (1) and (2), we define the sets of m-dimensional primal feasible subspaces, and 
dual feasible subspaces with respect to the regular cone C, respectively, by 

V m {C) := {H'eGv | W ± nC^{0}} , V m (C) := {W G Gr„, m | FFnC/{0}}. 

Note that, unlike in (P) and (D), there is no structural difference between primal and dual 
feasibility The primal feasibility of W with respect to C just means the dual feasibility of 
W 1 - with respect to C. In terms of the involution 2 

L m : Gr n ^ m — >■ Gr„ jra „ m , W i— >■ IF -1 , 

this can be expressed as tm(V m (C)) = P n _ m ((7). 

We claim that Gr„ im = V m (C) U V m (C). For this recall the well-known theorem on 
alternatives 3 , which for C = R™ is also known as Farkas' Lemma [16]: for W G Gr„ irrt we have 

w n int(C) / ^nc = {0}. (4) 

Now W G" V m {C) means IF- 1 n C = {0}, which by (4) is equivalent to W n int(C) / 0. This 
in particular implies W 6 V m (C). 

By a similar reasoning we obtain the following characterization of the set of m-dimensional 
ill-posed subspaces E m (C) := V m (C) PI V m (C) with respect to C: 

E m (C) = {VF G Gr n , m | IF n C / {0} and n int(C) = 0} . (5) 

Thus E m (C) consists of the subspaces IF 6 Gr„ i?n touching the cone C. As for the involu- 
tion i m , we obtain the following duality relations: 

(P m (C)) = I? 

n—rn 

(C), im (P m (C)) = P„_ m (C), (C). (6) 

The projection distance d p (Wi,W2) of IF r i,W2 G Gr„ jm is defined as the spectral norm 
d p (Wi,W2) '■= W^-Wi ~ nw 2 ||, where 11^ denotes the orthogonal projection onto Wj, cf. [19, 
§2.6]. Clearly, this defines a metric and a corresponding topology on Gr n)m . We will see, 
cf. (9), that d p (W^-,W^-) = d p (Wi,W2) so that the involution t m preserves the projection 
distance. 

The proof of the following basic topological result is provided at the end of the paper. 

Proposition 1.1. (1) The sets V m {C) andV m (C) are both closed subsets of Gv n jn . 

(2) The boundaries ofV m {C) andV m (C) both coincide with £ m (C). 

We define now the Grassmann condition of IF G Gr rajm as the inverse distance of IF to the 
set S m (C) of ill-posed subspaces, measured by the projection distance. 

2 In this context we interpret i m as an involution, since i„- m o t m = id m and i m o t n _ m = id n _ m , where id m 
denotes the identity map on Gr„ jm . 

3 Although this theorem of alternatives is folklore, we could not find a perfect reference for it in the literature. 
See for example [4, Thm. 3] for a complex version of (4). The given proof is easily adapted to the real case. 
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Definition 1.2. The Grassmann condition with respect to the regular cone C C M ra is defined 
as the function 

-K1.0O], ^ );= W ^ (C)) , 

where dp(W,£ ro (C)) := min{d p (VF, W) | W G E m (C)}. 

Because of (6) and since the involution i m preserves the projection distance, we have 

V C (W) = V d ( w± ) ■ ( 7 ) 

When the reference cone C is clear from the context we simply write ^ = and similarly 
for V m ,7J m , and 

1.3 Main results 

We first relate the Grassmann condition number to Renegar's condition number. Again we 
fix a regular cone C C W 1 . 

In the following let M™ xri denote the set of m x n- matrices of full rank m. A subspace 
W G Gr ni?n can be represented by a matrix .A G M™ xri in the form — im.(A^) — ker(yl)"'". 
The numerical quality of the matrix A is measured by its condition number k(A) = \\A\\\\A'\\, 
where A^ G M™ xm stands for the Moore-Penrose pseudoinverse of A. 

We call a matrix B G R mxn balanced iff BB T = I m . It is easy to see that k(B) = 1 iff 
B is balanced (cf. Lemma 2.1). The next result states that ^(W) equals Renegar's condition 
number 71(B) for a balanced representation B of the subspace W. 

Theorem 1.3. For B G R mxn balanced and W = im(B T ) we have 'jf(W) = 11(B). 

Now we address the question to what extent 1Z(A) deviates from ^(W), when we represent 
the subspace W by a nonbalanced matrix A such that W = im(^4 T ). As one may expect, this 
is quantified by the matrix condition number k(A). 

Theorem 1.4. For A G M™ xn and W = im(A T ) we have 

&(W) < 71(A) < k(A) • V(W) . 

By Theorem 1.3, the left-hand inequality states that 7Z(B) < 7Z(A) where B is balanced 
and im(B ) = im(A ). The right-hand inequality expresses the fact that a large condition 
71(A) is either caused by a large ^(W), i.e., W meeting/missing C at small angle, or caused 
by a large k(A), i.e., a badly conditioned matrix A representing the subspace W. 

Remark 1.5. As a preconditioning process, we may replace the input matrix A by a balanced 
matrix B such that im(A T ) = im(B T ). For instance, this may be achieved by a Gram-Schmidt 
orthogonalization of the columns of A. Then, for the three families of cones C corresponding to 
linear, second-order or semidefinite programming, we may apply the primal-dual interior-point 
method of [31] to decide the alternative (P)/(D) on input B with a number of interior-point 
iterations bounded by O (y/n(ln(n c ^' (W)) . This is true since ^(W) = 71(B) according to 
Theorem 1.3. 
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Theorem 1.4 allows to break up the probabilistic study of TZ into the study of the geometric 
condition ^ and the matrix condition n. In particular, for random matrices A we have 
Elogft(A) < Elog/«(A) + Elog<^(^). 

In the forthcoming paper [1] we will, based on methods from differential and spherical 
convex geometry, give tight bounds on the tail probability of ^ and the expectation E log ^{A) 
for random A G ]j mxn with independent standard Gaussian entries, for any regular cone C. 

We will see next that the Grassmann condition c if(W) can be characterized in terms of 
angles. The angle <(x,y) between two vectors x,y G W l \ {0} is defined by <(x,y) := 
arccos (pmrar)) where || || denotes the Euclidean norm. We define the angle between x and 
a subspace W G Gr„ iTn by <(x, W) := mm{<(x,y) \ y £ W \ {0}}. It is easy to see that 
<(x, W) = arccos (||ripi/(x)||/||x||), where Ily^ denotes the orthogonal projection onto W. 
Note that <(x, W) G [0, vr/2]. If W n C = {0}, we define the angle between the cone C and 
the subspace W via <(C, W) := min{<(x, W) \ x G C \ {0}}. 

Proposition 1.6. We have 

<# (w) -i = U^<C,W) , ifW(z V m (C) , 
" \sin<(C',iy ± ) , ifWeV m (C). 

Remark 1.7. In the dual feasible case, the Grassmann condition as characterized in Proposi- 
tion 1.6 was already considered by Belloni and Freund in [3, eq. (3)] and also the inequalities 
in Theorem 1.4 were derived. What is missing in [3] is the treatment of the primal feasible 
case, and the geometric viewpoint in the Grassmann manifold, which leads to a completely 
transparent picture with regard to duality. 

There is a natural Riemannian metric on the Grassmann manifold Gr„ iJn that is invariant 
under the action of 0(n) and which is uniquely determined up to a scaling factor [20]. The 
geodesic distance d g (Wi, W2) between W\, W2 G Gr njjn is defined as the minimum length of a 
piecewise smooth curve in Gr„ jm connecting W\ with Wi- One can nicely express d g (Wi, W2) 
in terms of the principle angles between these subspaces, see (8) below. 

The following result states that measuring the distance of W to S m (C) either with the 
projective distance or the geodesic distance leads to the same result. The resulting differential- 
geometric characterization of the Grassmann condition is the key to the probabilistic analysis 
of in [1]. We write d g (W, E m ) := mm{d g (W, W) \ W G S m }. 

Theorem 1.8. We have d p (W, S m ) = sind g (W, S m ) for W G Gr„ i?n . 

The remainder of the paper is devoted to the proofs of the results stated in this section. 
The proof of Theorem 1.8 will be given in Section 3, whereas the proofs of the remaining 
statements will be given in Section 4. The paper closes with a comparison of the Grassmann 
condition with the GCC condition for the cone C = . 

Acknowledgments: We are grateful to the criticism of the anonymous referees, which led 
to an improved presentation of the paper. 

2 Preliminaries 

We first recall the fundamental notions of principle angles between linear subspaces, going 
back to Jordan [21]. 
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Let W U W 2 € Gr„ >m and let B { G R mxn be balanced such that Wi = im(Bf), i.e., the 
rows of Bi form an orthonormal basis of Wj. Let o"i > . . . > a m denote the singular values of 
B\B\. Note that o\ = H-Bi-BjH < ||-Bi|| H-BjH = 1. The principal angles cci, . . . , a m between 
W\ and W 2 are defined as on := arccoso-j G [0,7r/2], cf. [5]. 

The principal angles depend only on the pair Wi, W 2 of subspaces. Their relevance derives 
from the known fact [21, 32] that two pairs of subspaces in Gr njm lie in the same 0(n)-orbit 
iff they have the same (ordered) vector a = (a±, . . . ,a m ) of principal angles. It follows that 
any orthogonal invariant metric on Gr n>m is expressible in terms of the principle angles. In 
fact, the following is true: 



dp(Wi, W 2 ) = sin ||ci!||oo = sin max ccj, d g (Wi, W 2 ) = \\a\\ 2 = J ct\ + . . . + a 2 m . (8) 



See [19, §12.4.3] or [29, §5.3] for a proof of the first equality and [32] for a proof of the second 
one. It follows that dp{W u W 2 ) < smd g (Wi, W 2 ). 

The nonzero principal angles between W\ and W 2 coincide with the nonzero principal 
angles between and W 2 ^~, cf. for example [22, Thm. 3]. Combined with (8), this yields 



We shall need the polar decomposition of a matrix. For any A £ M™ xn we define S = 
VAA T and B = S^A. Then S is positive definite and BB T = S~ 1 AA T S- 1 = I, hence B 
is balanced. One calls A = SB the polar decomposition of A, cf. [19, §4.2.10]. It is clear 
that A and S have the same singular values [19, §2.5.3] u\ > ■ ■ ■ > a m > 0. In particular, 
1 1 .A 1 1 = <7i = I (S 1 1 1 and \\A^\\ = cr" 1 = We shall call B the balanced approximation 

of A. Replacing A by its balanced approximation B may be interpreted as a preconditioning 
process. 

A linear map between Euclidean vector spaces is called isometrical iff it preserves the inner 
product between vectors. The next lemma is well known and summarizes some of the defining 
properties of balanced matrices. We include a proof for lack of a suitable reference. 

Lemma 2.1. For a matrix A € K™ xra the following conditions are equivalent: 

(1) A is balanced, 

(2) the map A T : R m — > R n is isometrical, 

(3) A restricted to ker(A)- L is isometrical, 



In this case, A A equals the orthogonal projection onto im.(A ) = ker(A) . 

Proof. A T is isometrical iff (A T ei) T (A T ej) = <5y for all i,j. This is equivalent to ejAA T ej = 
5ij, or AA T = I m , which means that A is balanced. We have thus verified the equivalence of 



We show now the equivalence of (1) and (3). Suppose that A is balanced. Then we have 
(AA T y\) T AA T y 2 = yjy 2 = (A T yi) T (A T y 2 ) for all y\,y 2 £ M m . Hence the map A restricted 
to kev(A) 1 - = imj4 T is isometrical. 




dp{Wt, w£) = d p (W u W 2 ), d g (Wt, w£) = d g (W 1 , W 2 ). 



(9) 



(4) <A) = \\A\\ = 1. 



(1) and (2). 
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To see the converse, suppose that (AA T yi) T AA T = {A T yi) T {A T y2) ■ This means that 
AA T AA T = AA T . Since AA T is invertible we get AA T = I rn . 

To see the equivalence of (1) and (4) suppose that A is balanced. Then \\A\\ = 1 by (2) 
and || At || = 1 by (3). Conversely, assume that ||A|| = ||At|| = 1. Since ||A|| is the largest 
and 1 1 A^ || _1 is the smallest singular value of A, it follows that A = U (I m 0) V for orthogonal 
matrices U G O(m) and V G O(n), cf. [19]. Hence (2) and thus (1) is true. 

For the last assertion, let x = x\ + X2 with x\ G ker(A) and X2 G im(A T ), say X2 = A T y2- 
Then A T Ax\ = and A T Ax2 = A T AA T y2 = A T y2 = X2- Hence A T A equals the orthogonal 
projection onto im(A T ). □ 

Remark 2.2. Recall that M mxn denotes the set of rank deficient matrices in R mxn . The 
Eckart- Young Theorem [14], see also [19, §2.5.5], states that d(A,M mxn ) = HA^- 1 , where At 
denotes the Moore-Penrose pseudoinverse of A. This implies that d(A,M™ xn ) = 1 iff A is 
balanced. 



3 Distances in the Grassmann manifold 

We show here that different choices of distances in the Grassmann manifold lead to the same 
notion of Grassmann condition. 

Lemma 3.1. Let W\,W2 G Gr njm such that W := W± n W2 has the dimension m— 1 and 
let the line L{ denote the orthogonal complement of W in W%. Then the -principal angles 
«!<...< a m between W\ and W2 are given by a± = . . . = a m _i = and a m = <(L±, L2). 
Moreover, d p (Wi, W2) = sina m and d g (W±, W2) = a m . 

Proof. Let Xi G Li, i = 1,2, such that ||xi|| = ||x2|| = 1 and <{xi,X2) = <(L±,L2). If 
the rows of Bi G R mxn consist of Xi and an orthonormal basis of W, then E>i is balanced 

(X^ X2 \ 

1 J . Hence the vector of principal angles 

U l m —i J 

between W\ and W2 equals (0, . . . , 0, a m ), where a m = arccos \x\ X2\- Equation (8) implies 
d p (W\, W2) = sinQ m and d g (W\,W2) = a m as claimed. □ 

Lemma 3.2. For x € R n \ {0} set £ x := {W G Gr n>m | x G W'}. Then we have for 
W G Gr n>m 

dp(W,S x ) = smd g (W,S x ) = sin<(x, W) . 

Proof. Let w G W \ be such that 8 := <(x, W) = <(x,w). Without loss of generality we 
may assume ||x|| = \\w\\ = 1. We have the orthogonal decomposition W = W + Mw, where 
W := W n w 1 . Note that x G since x — cos(#) w G W 1 - . Hence we have an orthogonal 
decomposition W := W + Rx and Lemma 3.1 implies that 

d P {W,£ x ) < smdg(W,S x ) < sind g (W,W') = sinO. 

It remains to prove that d p (W, £ x ) > sin^, For this, take any space W G £ x and put 
W := W' n x^. Then we have an orthogonal decomposition W' = W + M.x. In order to 
calculate the principal angles between W and W, let b±, ... , 6 m _i be an orthonormal basis of 
W and consider the balanced matrix B G W nxn with the rows x, b±, . . . , b m -i. Then we have 

Bw = (x T w, x T bi, . . . , x T b m -\) = (x T w, 0, . . . , 0) 
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since x G W . Therefore, \\Bw\\ = x T w = cos 9. Let B G W mxn denote a balanced matrix 
consisting of the first row w and an orthonormal basis of W. Using the fact that the smallest 
singular value o~ m of the matrix BB is given by u m = miri|| y || =1 \\BB y\\ (cf. [29, Thm. 1.4.3]), 
we conclude a m < \\BB ei|| = = cos 9. If we denote by a = (ai, . . . ,a m ) the vector 

of principal angles between W and W, we get ||a||oo > arccos<r m > and hence, using (8), 
d g (W',W) = sin Halloo > sin 9. □ 

Proof of Proposition 1.6. Note that by definition, 

T> m (C) = {W G Gr n , m \W'nC? {0}} = Uxec\{0} £ * ■ 
Combinig this with Lemma 3.2 we obtain 

d p (W,V m (C))= inf d p {W,£ x )= inf sin <(x, W) = sin <(C, W). (10) 
xec\{o} xec\{o} 

Moreover, Proposition 1.1 implies d p (W, S m (C)) = d p (VF, V m {C)) in the case W G V m (C). □ 
Proo/ o/ Theorem 1.8. Arguing as for (10) with d s instead of d p and using Lemma 3.2, we get 

smd g {W,V m {C)) = inf smd g (W,£ x ) = inf d p {W, S x ) = d p (W, V m {C)). 
xeC\{o} xec\{o} 

This implies the assertion in the case W G V m {C). 

For the case W G V m (C) we recall that the involution i m - Gr nm — >■ Gr nn _ m , 
is an isometry both for d p and c£ ff , cf. (9). This implies 

d p (W ± , E n _ m (6)) = dp(W, E ro (C)), d 9 (^, E n _ m (6)) = d p (W, E ro (C)), 

as a consequence of the duality relations (6). Hence the assertion in the case W G V m (C) 
follows from the one for W G V m (C). □ 

We complement these results by a different characterization of the projective distance. 
The Hausdorff distance between W\ and W2 in Gr„ jm is defined as 

d H (W 1 ,W 2 ) = max{<(x,W 2 ) | xG VFi\{0}} . (11) 

This notion of distance evolves from the identification of a subspace W G Gr„ i?n with the 
subsphere W n 5 n_1 of the unit sphere. As the unit sphere is a metric space, also the set 
of closed subsets of S*™ -1 is endowed with a natural metric, which is known as the Hausdorff 
metric (cf. for example [23, §1-2]). For subspheres resp. subspaces via the above identification, 
this metric is given as stated in (11). 

Proposition 3.3. For W\,W 2 G Gr„ jm we have d p (Wi, W2) = sindniWi, W2). 

Proof. Let a denote the vector of principal angles between W\ and W 2 - Further, let the rows 
of Bi G R mxn form an orthonormal basis of Wj, for i = 1,2. Using the characterization of the 
smallest singular value of a matrix A via miri|| y || =1 ||Ay||, and using Lemma 2.1, we get 

cos Halloo = min II.B2-B1VII = mm 11-^2^11 = m i n lin^^ll • 
\\y\\=l x^WinS"- 1 xeWiHS"- 1 



Applying the arccosine thus yields 

||a||oo = max arccos 
xew^S"- 1 

The claim follows with (8). □ 



|a||oo = max arccos(||nvi/2^||) = max <(x, W 2 ) = dn(Wi, W 2 ) 
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4 Perturbations of balanced operators 

We provide here the proofs of the remaining results stated in the introduction. 

Suppose B G M™ xn and consider a line Rx that is not contained in W = im(B T ). What is 
the minimum norm of a perturbation A of B such that Mx C im(B T + A T )? The lemma below 
shows that, for a balanced matrix B, the answer is given by sin<(x, W). We also answer the 
analogous question with regard to kev(B). 

The Frobenius norm of a matrix A G M. mxn is defined as \\A\\p ■= ti{AA T ) 1 ^ 2 . Recall that 
\\A\\ denotes the spectral norm, that is, the largest singular value. Both matrix norms are 
invariant under the left and right multiplication with orthogonal matrices. We will frequently 
use the well known fact that ||x?/ T || = ||xy T ||,F = ||x|| ||y|| for x G M. m and y G W l . (This 
follows easily from the orthogonal invariance.) 

Lemma 4.1. Let B G R mxn be balanced and W := im(B T ). Furthermore, let x G W l \ {0} 

and put a := <(x, W), /3 := <(x, W L ) = f - a. 

1. Then for all A, A' G R mxn , we have 

xeim(B T + A T ) =>- ||A||> since, 
xGker(S + A') ^ || A'|| > sin/3 . 

2. Additionally suppose that x W^. Then there exist matrices A, A' G R mxn of rank at 
most one such that ||A||^ = sina, ||A'||i? = sin/3, andx G im(i? T +A ), x G ker(5+A'). 

Proof. 1. If x G mi{B T + A T ), then there exists v G S m ~ l and r > such that (B T + A r )w = 
rx. Then we have, as ||-B T t>|| = 1, 

||A|| > ||A T w|| = \\rx — B T v || > sin <(x, B T v) > since. 

If (B + A') • x = 0, we have, writing x° := ||x|| _1 x, 

||A'|| > ||AV|| = ||,Bx || = \\B T Bx°\\ = cosa = sin/3, 

as B T B is the orthogonal projection onto W, cf. Lemma 2.1. 

2. Without loss of generality, we may assume ||x|| = 1. Then the matrices in the lemma 
may be chosen as 

A := Bp (cos(a) x — p) T , A' := -Bxx T , 

where p := cos(a)~ 1 B T Bx is the normalized orthogonal projection of x on W. These are 
matrices of rank at most 1. Using the fact ||yz T ||ir = ||y||||-z|| we obtain for their Frobenius 
norms 

|| A\\p = \\Bp\\ ■ || cos(a) x — p\\ = sina 

and 

||A'||ir = ||-Bxx T ||^ = \\Bx\\ ■ \\x\\ = \\B T Bx\\ ■ \\x\\ = cosa = sin/3 . 

Furthermore, we have 

(B T + A T )Bx = B T Bx + (cos(a) x - p) p T B T Bx 

= cos(a) p + cos(a) • (cos(a) x — p) p T p = cos 2 (a) • x , 

which shows that x G im(B T + A T ). Moreover, we have (B + A')x = Bx — Bxx T x = 0, which 
shows that x G ker(B + A'). □ 
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Proof of Theorem 1.3. Let B G R mxn be balanced such that W = im(B T ). We shall distin- 
guish two cases. 

(i) We assume that W G V m (C). Lemma 4.1(1) implies that for any x / 0, 

inf{||A|| | x G im(B T + A T )} > sm<(x,W) . (12) 

In the case x G" W- 1 -, Lemma 4.1(2) implies that equality holds (and the infimum is attained). 
By a limit consideration it follows that equality also holds for x G W 1 - \ {0} (but the infimum 
may not be attained). It follows from the equality in (12) that 

inf{||A|| | im(B T + A T ) = inf sm<(x,W) = sm<(C,W). (13) 

x£C\{0} 

We conclude that d{B,V) = sm<(C,W), where V := {A £ R mxn | im(^ T ) fl(7/0}. 

By (2), we have d(B,F°) = mm{d(B,V),d(B,R mxn )}. But the Eckart- Young Theorem 
(cf. Remark 2.2) implies d(B, R™ xn ) = 1. Therefore we have by the definition of Renegar's 
condition number (3), 

TZ(B)- 1 = d{B, E R ) = d(B, T°) = d(B, V) = sin <(C, W). 

Here we used the assumption W G V m (C), which means B G J 7 ^. Finally, Proposition 1.6 
states that c t?(W)- 1 = sin<((7, W). Hence we conclude that tf(W) = 11(B) in the case 

W G VmiC). 

(ii) We assume now that W G V m (C), that is, -B G J 7 ^. Lemma 4.1(1) implies for x / 
that 

inf{||A'|| | x G ker(5 + A')} > sin<(x,VF ± ) 

and Lemma 4.1(2) shows that equality holds. Taking the infimum over all nonzero x G C it 
follows that 

TZ(B)- 1 = d{B,Fl) = inf {IIA'II | ker (B + A') n C + {0}} = sin <(C, W x ) . 

On the other hand, W G V m (C) implies W 1 - G V m (C) and therefore, Proposition 1.6 yields 
^(W^r 1 = sin<((7, W^). Hence we conclude that ^(W 1 ) = K C (B). Finally, due to (7), 
we get tfciW) = ^ G (W L ) = K C (B), which completes the proof of Theorem 1.3. □ 

Remark 4.2. The proof of Theorem 1.3 shows that dp(A,T, B ) = d(A, £ R ) for a balanced 
matrix A G R mxn , where d and dp denote the distances measured in the spectral and Frobenius 
norm, respectively. In fact, this equality also holds for the nonbalanced case. In the dual 
feasible case A G J-" R , a perturbation A such that A + A G J-" R and || A\\p = d(A, 

Z-'r) is given 

by A = -App T , where p G C n S 1 "- -1 is chosen such that \\Ap\\ = mm{\\Aq\\ \ q G C n S 1 ™" 1 }, 
cf. [3, Lem. 3.2]. In the primal feasible case A G J-" R , the fact that one can find rank-one 
perturbations A such that A + A G £ R and ||A||,p = d(A, E R ) follows from [24, Prop. 3.5]. 

Proof of Theorem 1.4- Let A = 55 be the polar decomposition of A G M™ xn . Then is 
balanced and S = V AA T . Since A and S have the same singular values we have ||5|| = 1 1^4)1 
and US' -1 !! = ||y4t|| (compare Section 2). By the main Theorem 1.3 we have ^(W) = 11(B). 
Thus, by the definition (3) of Renegar's condition number, the assertion of Theorem 1.4 is 
equivalent to 

d(5,S R ) " djA^j ~ Ul1 l|j4t|1 d(5,S R )' 
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or, equivalently, 

1| d(-B,£ R ) < d(A, £ R ) < \\S\\d{B,Z B ) . (14) 

To show the right-hand inequality, let B G E R be such that <i(-B,E R ) = ||5 — B\\. We 
define := SB. Then we have A G E R by the invariance of S R under the GL(m)-left action 
on R mxri . Therefore, 

d(A,E R ) < ||A-i|| < 115(5-5)11 < ||5|| \\B-B\\ = ||S||d(B,£ R ). 

For the left-hand inequality, let A G S R be such that d(^4, S R ) = ||A — A\\. We define 
5 := 5~M and note that B G £ R . Then we have 

d(B,E R ) < ||B-B|| < IIS-^A-AJII < ||5 _1 || ||A - t4|| = ||5 _1 || d(^4, £ R ). □ 

Theorem 1.3 combined with a known characterization of Renegar's condition number in 
the primal feasible case implies the following result. Let B n denote the closed unit ball in W l . 

Corollary 4.3. For W G V m (C) we have ^(Wy 1 = max{r | r • B n n W C n w (C n £„)}. 

Proof. We use the following known characterization of Renegar's condition number in the 
primal feasible case: For A G J 7 ^ and ||A|| = 1 we have (see [27] or [24, Cor. 3.6]) 

K(A)- 1 = max{r | r • B m C A(5 n n <?)} . 

Let 4 6 be balanced such that W = im(yl T ). From Theorem 1.3 and from the above 
characterization of 7£(A) -1 we get 

^{Wy 1 = max{r | r ■ B m C i4(B n n C 1 )} = max {r | r • A T (5 m ) C ,4 T A(5 n nC)}, 

where the last equality follows from the fact that the map A T : R m -»• M n is isometrical, 
cf. Lemma 2.1. Since ^4 T A is the orthogonal projection H\v : — > W, we get 

^(Wy 1 = max {r \ r ■ B n nW <Z U w (B n nC)}. □ 

Finally, for the sake of completeness, we provide a proof of the topological result stated in 
the introduction 

Proof of Proposition 1.1. (1) Note that £ := {(x,W) G 5 n_1 x Gr„ im | x G W} is a closed 
subset of 5 n_1 x Gr ni?n . The set T> m (C) is obtained as the projection of the compact set (C x 
Gr n;m ) n£ onto the second component. A standard compactness argument shows that V m (C) 
is closed. The closedness of V m (C) = t~ 1 (P„_ m (C 1 )), cf. (6), follows from the closedness of 
P n _ m (C) and the continuity of i m . 

(2) As V m {C)UV m (C) = Gr n>m , we have Gi n , m \V m (C) C X> ro (C), and thus 3P m (C) C 
E m (C). Analogously, we have dV m (C) C £ m (C). 

For the other inclusion suppose T4 7 G £ m (C), say x G W D C for some x / 0. There 
exists a sequence X& G int(C) such that ||x — Xfc|| < \ for all k > 0. Put W := W H x 1 - and 
define := lij + W. Then Wk converges to W for k — > oo. As x& G n int(C) we have 
W k G Gv n , m \P m (C) by (4) and hence W G Gr„, m \P m (C). This proves £ m (C) C 87> m (C). 
We have thus shown that dV m (C) = £ m (C). The assertion &D m (C) = E m (C) follows now 
from the duality relations (6). □ 
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5 Comparison with the GCC condition number 



The GCC condition number, introduced in [18, 10], is only defined for the cone C = R". We 
briefly compare this notion with the Grassmann condition number in this special setting. 

For p G S*™ -1 and < a < 7r we denote by cap(p, a) := {x G S™ -1 | (x,p) > cosa} 
the spherical cap with center p and angular radius a. Let A 6 R mxn with nonzero columns 
oi, . . . ,a n G R m . The GCC-condition number ^ G cc(^4) can be characterized as ^gcc(^4) _1 = 
| cosp|, where p is the minimum angle of a spherical cap containing all the points aj/||aj||, 
see [10] or [8, §6.5]. 

The following relationship between Renegar's condition number 1Z(A) and the GCC con- 
dition ^GcciA) was established in [10, Prop. 4-5]: 

^jp 11 • 11(A) < tf GCC (A) < Vn-K(A). 

By Theorem 1.4, this immediately implies, setting W := im(^4 T ), 
mim ||a ; " 



The next two examples show that this estimate cannot be substantially improved in the 
sense that both quotients '^'(W)/'^ ? GC c(A) and ^Gcci^/^iW) cannot be bounded as a func- 
tion of the dimensions m,n only. (The first example is from [3, Ex. 3.1/4.1].) Let e > and 
consider the matrices 

(2e 1 1\ ~ (l + e l+e -l + e 

Ae Vo -l i) ' Ae v -l -l i 

We have ^cc^e) = V% since the smallest enclosing cap of the three points obtained by 
normalizing the columns of A £ has the center p = (1,0) and the angular radius p = 7r/4. 

Similarly, we obtain C ^' GCC (A £ ) = /7^irnq~|~7m = f + f° r e - ^ 0. As for the Grassmann 
condition numbers, we put W £ := im^^ ) and define W £ similarly. For A £ we are in the dual 
feasible situation and W £ equals the line spanned by (1, — e, — e). Proposition 1.6 now easily 
implies that ^(W^ 1 = sin a, where cosa = ^ 1 '~^^^'°' ^ . Hence, for e — > 0. 

» (Wi )_^__^^_> (1 + 0( 1)). 
sma eV2 

Furthermore, W £ equals the span of (1, 1, —1), (1, 1, 1) and hence is independent of e. Alto- 
gether, it follows that c &(W e )/ c & GC c{A e ) and & G cc(Ae)/ e i£(W e ) are unbounded, as e — > 0. 
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